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Abstract 

A positive mass theorem for General Relativity Theory is proved. The proof is 4- 
j> . dimensional in nature, and relies completely on arguments pertaining to causal structure, 
the basic idea being that positive energy-density focuses null geodesies, and correspond- 
ingly retards them, whereas a negative total mass would advance them. Because it is not 
concerned with what lies behind horizons, this new theorem applies in some situations 
not covered by previous positivity theorems. Also, because geodesic focusing is a global 
condition, the proof might allow a generalisation to semi-classical gravity, even though 
O" 1 quantum violations of local energy conditions can occur there. 
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I: Introduction 



Fifteen years ago, the construction and proof of a suitable positive mass theorem for Gen- 
eral Relativity remained one of the major unsolved problems of the theory. The first proof 
of such a theorem was given by Schoen and Yau/ 1 ) as the culmination of a considerable 
effort by several workers. The proof they obtained was mathematically sophisticated and 
not easily accessible to many physicists. Shortly thereafter, Witten^ 2 ) found a method of 
proof which could be given in easily understood terms, though the rigourous mathematical 
justification of the method had to await subsequent work by other authors. ( 3 ' 4 ) 

One might pose two questions concerning these efforts, the first pertaining to the 
generality of the theorems proved. It would seem unreasonable to expect that a theorem 
could be proved even when the spacetime is nakedly singular (after all, there is the negative 
mass Schwarzschild solution) or when the non-gravitational energy of the matter present 
is mostly negative (for one expects gravitational binding energy to be negative as well, so 
how could the sum be positive?) . Yet one still might ask whether the assumptions required 
for the existing proofs could be weakened or altered, say by relaxing the condition of local 
positivity of energy or by finding a proof which holds on spacetimes that lack complete 
spacelike hypersurfaces or do not admit Sen-Witten spinors. It will be seen that, in fact, 
all these conditions can be relaxed, although other conditions (such as the so-called generic 
condition, a particularly mild assumption in the present context) have to be added. This 
brings one to the second and related question, which is more aesthetic in nature. If one is 
interested in identifying those features of an asymptotically flat spacetime which lead to a 
positive total mass, one might like to have a theorem whose proof relies as clearly and as 
deeply as possible on physically intuitive concepts. The second question, then, is whether 
there exist positive mass theorems which make more explicit use of what one might feel 
are the physical mechanisms that act to ensure that the mass is positive. The theorem 
presented herein is hoped to be of this nature. 

The idea is to show that negative mass spacetimes are inherently inconsistent. The 
key to the contradiction is the behaviour of null geodesies in gravitational fields. In 1964, 
Shapiro^ 5 -* noticed that light rays emitted at an infinitely distant point and passing near 
a positive mass should be delayed with respect to rays passing farther from the mass, and 
that this effect could be used as the basis for what is today a very accurate test of General 
Relativity. ^ For present purposes, it is the complementary observation that is of primary 
interest, specifically that the rays that pass infinitely far from a positive mass "source" 
are infinitely advanced with respect to rays passing nearby the source. In the negative 
mass case, as one would expect, the distant rays are infinitely delayed with respect to 
the nearby ones. From this effect one may infer that, in the negative mass case, there 
are points in arbitrarily distant future regions of spacetime which are not causally related 
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to other points in arbitrarily distant past regions. In any such situation, there will exist 
null geodesies which traverse the boundary between causally related points and unrelated 
ones, and in this case it will be possible to show that one such null geodesic does so for an 
infinite affine distance. (This geodesic will pass nearby or through the source, even though 
its existence is a consequence of the behaviour of null geodesies passing arbitrarily far from 
it.) 

Yet it is equally possible to argue that this null geodesic cannot exist. On one hand, 
it cannot have a pair of conjugate points since, if it did, it necessarily could not remain on 
the boundary between causally related and unrelated points for more than a finite affine 
distance. (Recall that conjugate points are pairs of points where some Jacobi field — which 
measures the separation of neighbouring curves of a congruence — vanishes/ 7 ' 8 -' When this 
happens, focusing of the congruence is said to have occurred. See fig. (1).) But on the other 
hand, the existence of pairs of conjugate points along every null geodesic is a consequence of 
the focusing theorems^ 9,10,11 ) which play such a central role in the proofs of the singularity 
theorems/ 7 ' 8 -' and which are now known to hold under very general conditions. One thus 
obtains the desired contradiction in any situation in which a focusing theorem can be 
proved. 

The conditions assumed by most focusing theorems express two ideas. One is that each 
null geodesic should encounter curvature which produces a converging or shearing effect 
on the cross-section of a congruence of neighbouring null geodesies; * the other is that it 
should not encounter enough negative energy matter to prevent the ensuing appearance 
of conjugate points. It is also necessary that the null geodesic of interest should traverse 
a singularity-free region of the spacetime, which in particular guarantees that it can be 
extended far enough to develop conjugate points. This is insured by requiring that the 
subset of spacetime which is outside all event horizons be globally hyperbolic. Incidentally, 
this form of causality condition means that the positivity theorem will apply in spacetimes 
with causality violations, provided those violations are hidden behind black hole horizons 
(or, for that matter, inside white holes). The only other conditions that will be imposed 
pertain to asymptotic flatness. Asymptotic flatness is first of all used in order that the 
notion of energy may sensibly be defined, but the correlative existence of geometrical 
structures at infinity will also prove to be an effective technical tool in the proof. 

* Of course, a generic curvature tensor will produce both shearing and convergence (or 
divergence if the energy density is negative) , but even pure local shearing ( "tidal shearing" ) 
will produce convergence at second order. Thus the condition means physically that the 
geodesic passes through a converging gravitational lens. 
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The theorem thus obtained is quite different in character from the earlier theorems, 
which have a distinct 3 + 1 flavour. ** With those theorems, one relies on the conservation 
of energy to argue that, since every hypersurface that asymptotes to spatial infinity will 
carry the same mass, the positivity proof may proceed entirely on a single hypersurface; 
and, in fact, the arguments used are most often couched in terms of Cauchy data for a 
spacetime, rather than in terms of the spacetime itself. The present proof, in contrast, 
is global in nature and is quite reminiscent of the sort of argument used to prove the 
singularity theorems. It is noteworthy that both primary ingredients of the proof, namely 
these global techniques and knowledge of the time delay /advance effect, were available 
long before any of the existing positivity theorems were proved. 

The fact that positive energy is derived herein from focusing may be important in 
connection with the question of the semi- classical stability of Minkowski space. The usual 
argument for classical stability (and also stability against quantum tunnelling) relies on 
the positivity of mass as proved for stress-energy tensors that obey local energy conditions 
or energy conditions integrated over spacelike surfaces. Such energy conditions in general 
fail for quantum matter (or quantum gravitons) coupled to a classical metric, and so the 
usual type of argument fails semi-classically. However, it is conceivable (and presently very 
much an open issue) that energy conditions integrated along null geodesies may hold in 
appropriate circumstances, even in the presence of quantum matter. If such semi-classical 
focusing theorems could be established, they would lead to an extension of the positive 
energy theorem presented herein which would be valid in the context of the semi-classical 
Einstein equation. 

In the presentation that follows, Section II. 1 states the theorem in a general form, 
relying on the assumption that null geodesies focus; i.e. develop conjugate points. Section 

II. 2 discusses sufficient conditions under which every infinite null geodesic will in fact focus, 
these being the conditions alluded to above. Section III discusses the behaviour of null 
geodesies propagating in the asymptotic regions of spacetime. The utility for this purpose 
of solutions of the Hamilton- Jacobi equation is the subject of Section III.l, while Section 

III. 2 contains the calculations themselves. One can show that an important property of 
null geodesies from a fixed initial endpoint at past null infinity is governed by the sign of a 
certain combination of the asymptotic metric coefficients — if this quantity is positive, the 
geodesies form a sequence receding into the distant future and, if it is negative, they form 
a sequence converging upon spatial infinity. The quantity that governs this behaviour is a 

** This flavour is less pronounced in the case of the Witten proof which, although still pro- 
ceeding on a spacelike hypersurface, can be presented in a form where all the basic quan- 
tities which appear (including the superpotential for the ADM energy) are 4-dimensional 
in character^ 4 ) . 
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component of the spacetime's 4-momentum, as shown in the appendix. Section IV gives 
the formal proof of the positivity theorem. Section V points out some purposes for which 
the present theorem is more general than the others, and contains some speculations on 
possible further generalisations of the result, particularly concerning the cases of null and 
zero 4-momentum. 

Many of the ideas herein grew out of discussions following a seminar given by one 
of the authors (R.P.) at Syracuse University in the Spring of 1990. Subsequently, two 
different but related approaches to proving the result described in this paper were found 
and have been reported elsewhere.^ 12 ' 13 ) This work is intended to give a full proof of the 
results stated in those previous works and to generalise them. For an early statement of 
some of the ideas behind this proof see Ref. (14) and the related ideas in Ref. (15); for an 
alternative development of part of the work, see Ashtekar and Penrose/ 16 ) 

The conventions adopted are those of the Landau-Lifshitz Spacelike Convention used 
in Refs. (7, 17, 18). Specifically, the signature of the metric is taken to be ( — , +, +, +), and 
the Ricci tensor is defined so that it is the positive sign of timelike and null components 
of R a b which leads to focusing (see Subsection II.2 for a more complete discussion). Small 
roman letters are used for abstract indices, while small greek indices refer to components 
in chosen coordinates, usually some quasi-Cartesian system near infinity as in Section 
III. Spacetime means a manifold (Hausdorff, paracompact, 4-dimensional except where 
higher-dimensional generalisations are discussed) with a Lorentzian metric; it may possess 
singularities (provided they are suitably attired; again, see subsequent discussion). A 
causal curve in spacetime will generally refer to a curve-without-parametrisation — the 
image of a continuous Lipshitz map from an interval of IR into spacetime (the Lipshitz 
condition is automatic for causal curves). The chronological (resp. causal) future I + (X) 
(resp. J + (X)) consists of all points joined to the point X or to points in the set X by 
future timelike {resp. future causal — future timelike or null) curves from X. Single points 
are considered curves of zero length, and therefore p G J + (p) but p ^ I + (p)- Lastly, A\B 
denotes the complement of the set B in the set A, i.e. those points of A that are not in B. 

II: Focusing and Positivity 

II. 1: Statement of the Theorem: 

The result to be proved refers to spacetimes which are asymptotically flat in a sense to 
be made precise in Section III (see especially equations (III.2.1) and (III. 2. 2) and the 
discussion following equation (A. 3) in the appendix). The spacetime manifold Ai, with 
Lorentzian metric g a i, will be regarded as embedded in a larger manifold M. (with con- 
formally related metric g a b) in order that one may refer to points at infinity in the usual 
manner. ( 18 ) Thus, one defines the boundary J of M in M, and its disjoint subsets, T + 
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(X~) = future (past) null infinity and {i } = spatial infinity. Note that M. is taken to 
extend slightly "beyond infinity" in order that it can be a manifold — albeit not a C°° 
one — even at i°. 

Notice that X is the union of the null geodesies* emanating from i°. Notice also that 
X represents only a single asymptotic region of A4; if there are others, they will be ignored, 
and only the mass associated to the one whose "boundary at infinity" is X will be studied. 
Thus X + , for example, consists of the ideal endpoints of out-going null geodesies which 
reach arbitrarily great radii in the asymptotic region in question. One also defines the 
domain of outer communications V = I + (X~) n I~(T + ) (the set of events which "can 
communicate with the asymptotic region"; an example is depicted in fig. (2)). It will be 
assumed that V satisfies the following conditions: 

• Every infinite null geodesic in V possesses a pair of conjugate points. 

(In fact, this condition can be weakened to one pertaining only to achronal null geodesies, 
as will be seen later.) 

• VUl is globally hyperbolic as a subset of M; a globally hyperbolic set being one which 
is strongly causal and which contains ((p,q)) '■= J + (p) H J~(q) as a compact subset, 
for each p and q in the set. 

The conditions for asymptotic flatness will be taken to hold on the intersection M of some 
neighbourhood of X with T>. One has then: 

Theorem II. 1.1: The 4-momentum of Ai is future-causal. 

Equivalently, the theorem states that the ADM energy is non-negative in every rest- 
frame. Notice that the theorem as presented above does not explicitly assume Einstein's 
equations. Rather, those features of the curvature which are essential for the theorem to 
hold have been extracted and posed as the assumptions. Positivity of mass emerges as a 
property of general Lorentzian manifolds that focus null geodesies and have an adequate 
asymptotic flatness property. Upon assumption of Einstein's equations, one can replace 
the conditions on the curvature by conditions on the matter tensor. In the next subsection, 
it will be seen that reasonable conditions on the matter tensor act to enforce the conjugate 
point condition. It may also be appropriate to mention here that such causality conditions 
as will be needed will apply only to V; there is no need to forbid, for example, closed 
timelike curves hidden behind event horizons. 



* That is, null geodesies with respect to g a b- Recall that conformally equivalent metrics 
possess the same null geodesies. 
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112: Focusing Theorems and the Existence of Conjugate Points: 



The condition requiring the existence of conjugate points may be imposed by assuming 
instead other conditions which suffice to imply the necessary focusing, as mentioned in the 
introduction. The traditional way to do this has been to require that the energy density be 
pointwise non-negative, and to assume in addition that the curvature tensor satisfies the so- 
called generic condition defined below. For details, the reader may consult Refs. (7,8,19). 
However, recent work shows that geodesic focusing occurs under conditions considerably 
more general than originally known. An example of such conditions would be that along 
every infinite null geodesic the integral condition sometimes (somewhat inappropriately) 
called the Averaged Weak Energy Condition hold, with every such geodesic encountering 
a point of generic curvature. Similar conditions have been used in recent discussions 
of causality violating spacetimes*- 20 -' and are drawn essentially from work of Tipler^ 9 ). 
However, even Tipler's conditions can be weakened, as was shown by Borde^ 10 ), who 
proves the following theorem. 

Borde's Focusing Theorem 2: Let 7 be a complete causal (i.e. null or timelike) 
geodesic with affine parameter t and tangent T a ; and let T[ a Rt,] c d[ e Tf]T c T d 7^ 
somewhere on 7. Suppose further that for each e > there exists B > such 
that, for every pair of affine parameter values t\ < £2, there exists an interval I\ 
with length > B and endpoints < ti, and an interval I2 with length > B and 
endpoints > £2, such that 

t" 

J R ab T a T b dt > -e V£' e h , V£" e h ■ 

v 

Then 7 contains a pair of conjugate points. 

Also, Kannar* 11 ) recently has produced a variant of this theorem which imposes an integral 
condition directly on components of the Riemann tensor instead of on components of the 
Ricci tensor. 

A spacetime in which T[ a R b ] cd [ e Tf]T c T d 7^ somewhere on every geodesic (with T a 
tangent to the geodesic) is said to obey the generic condition. For present purposes, 
it suffices that this hold on every null geodesic, and the word generic will be used in 
that sense. Satisfaction of this condition is not at all a strong assumption, being indeed 
generic in a suitably rigourous sensed 19 ' 21 -' This last fact means in particular that, even for 
spacetimes which do not satisfy the condition, there should be nearby generic spacetimes 
with almost the same total mass. Thus a proof of positivity of mass for generic spacetimes 
should imply the same result for arbitrary ones. 
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The motivation for studying general conditions under which focusing might occur 
came from attempts to analyse the singularity theorems (and more recently the so-called 
chronology protection conjecture'- 22 '' forbidding formation of closed timelike curves) in the 
context of classical gravity coupled to quantum fields (including the graviton field). *- 23 - ) 
The present work raises in addition the possibility of an application to proving a positive 
energy theorem in this same "semi-classical" context. 

If integral energy conditions of the sort used in Borde's theorem were to hold for 
arbitrary states of arbitrary quantum fields in arbitrarily curved spacetimes then the proof 
of positivity of mass presented herein would generalise straightforwardly to the case of 
semi-classical gravity. Although some encouraging results in this direction are available** 
(in particular the theorems of Refs. (24, 25) and the indications from recent work of Ford 
and of Roman that the uncertainty principle places strong constraints on the negative 
energy distributions arising from quantum fields in fiat and certain curved spacetimes^ 26 )), 
it now appears that integral energy conditions can be violated by test quantum fields 
in certain curved spacetimes/ 24 ) However, no such violation is known in the presence of 
the semi-classical Einstein equation, and even in situations where one would expect that 
approximate equation to fail (such as for an exploding black hole) the obvious candidates 
for "no n- focusing" null geodesies turn out, on closer consideration, to be likely to possess 
pairs of conjugate points. 

Finally, it is noteworthy that focusing theorems can be generalised to certain metrics 
that fail to be C 2 , for example ones with shock wave discontinuities in the curvature or 
the connection/ 27 ^ This makes it likely that a version of Theorem II. 1.1 will still hold in 
such cases, thereby providing a positivity result appropriate to such generalised solutions 
of Einstein's equation. 



** In this connection, it might prove important that the proof given in Section IV below 
deals with a null geodesic which is achronal by construction. In deriving the required 
contradiction it therefore suffices to know that every achronal null geodesic has a pair 
of conjugate points (and therefore is self-contradictory). Hence it would suffice to have 
available an integral energy condition known to hold only for achronal null geodesies, as in 
Ref. (24). Notice in particular that although, for compact spacetimes, the Casimir energy 
may be negative, this need not imply that the integrated energy conditions fail on achronal 
null geodesies. 
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Ill: Null Geodesies in a Neighbourhood of Infinity 

III.l: The Phase Delay and the Retarded and Advanced Time Functions 

In 1960, Plebahski^ 28 ) published a study of the behaviour of null geodesies propagating in 
0(l/r) weak gravitational fields. He used his results to compute light bending, but did 
not explicitly note the time delay effect, nor did he comment on the complementary effect, 
i.e. the infinite time advance of null geodesies as one approaches infinity in a positive mass 
spacetime. His method of solving the Hamilton- Jacobi equation perturbatively about a 
flat background metric (namely that entailed by asymptotic flatness) will be employed 
here. The analysis of this section will be valid only on a suitable neighbourhood of infinity, 
one on which an appropriate background structure may be defined, as detailed in the next 
subsection. 

Fix a generator A - of X~ , let p be some point on A - , and consider the set of all null 
geodesies originating at p. This set is a subset of the non-rotating congruence of all null 
geodesies whose initial endpoints lie on A - . Since this is a non-rotating congruence, it can 
be described by a single scalar function S, which is known by many names, the Hamilton- 
Jacobi function, the eikonal, the phase (of the waves carried along by the congruence), etc. 
It obeys 

g^dpSdvS = . (III.l. 1) 

Its significance is that the null geodesies leaving A~ generate the level surfaces of S : so 
knowledge of S defines in a succinct manner the set of null geodesies from each initial 
point p G A - . When S is normalised so that it reduces on A - to a suitably defined affine 
parameter, it will be called the advanced time function and denoted by S~ . 

Consistent with asymptotic flatness, let there exist quasi-Cartesian coordinates {x^} 
in a neighbourhood of infinity, by which is meant coordinates in which the (inverse) metric 
takes the form 

g »u = r pv + h pu ; h» v = 0(l/r) , (III.1.2) 

where t = x , r 2 = x^x^ (with j running from 1 to 3) and rj^ 11 is the (inverse) Minkowski 
metric, having components diag(— 1, 1, 1, 1). The precise sense in which 0(l/r) is to be 
understood will be described shortly. Then one may expand S about its Minkowski space 
form as 

S = r ]llv k lx x v + AS , where r)^ v k»k v = , (III.1.3) 

with k^ having constant components in the x M system. The Hamilton-Jacobi equation 
(III. 1.1) becomes 

k^d^AS = k il k v + £{x^ , k^ := f]^ u k , 

\ (III.1.4) 
e{x») = - -{rfd^Sdv&S + h^d„ASd u AS + 2h^k^d u AS} , 
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where S represents an error term of second order in hP' v which will vanish like l/r 1+e 
or faster as r — > oo. Note the definition of which implies that it also has constant 
components. Neglecting the error term, one may integrate equation (III. 1.4) to yield 



where n J = /c J //c° is the unit vector in the /c J -direction and C is a constant of integration. 
When to is taken to negative infinity with a suitable choice of integration constants,* 
S = S~ describes a plane wave emanating from the generator A - of I~ . Note that the 
support of the integrand in equation (III. 1.5) is along the Minkowskian-null line through 
(t,x J ) parallel to k^. In what follows, k^ will be taken to have components (1,0,0,1), so 
that ri^ v k^x v = z — t describes a plane wave propagating in the ^-direction. 

Analogous to S~ is another function, the retarded time function, S + which will also 
be needed in what follows. It furnishes a function on spacetime which is constant along 
each null geodesic whose future endpoint lies on A + , the generator of X + that "continues" 
A - . This function also obeys the Hamilton-Jacobi equation (III. 1.1) and the resulting 
equations (III. 1.3) with a AS given (approximately) by (III. 1.5) with the same k^ but the 
integral taken from t to oo rather than from — oo to t. 

An important caveat is that S + (like S~) is in general not everywhere well defined. 
The problem is that null geodesies focus, so two or more null geodesies that have different 
endpoints on A + may pass through some given point x in spacetime, assigning to that 
point two or more different values of S + . However, these values always have a lower bound, 
since null geodesies from any fixed x in spacetime cannot arrive at I + arbitrarily early. 
The greatest lower bound of S + at x is physically significant in that it gives the earliest 
(retarded) time of arrival at A + of any null geodesic (and hence any signal whatsoever) from 
x. In what follows, however, one is able to limit consideration to null geodesies contained 
within the asymptotic region, and for them the (obviously single-valued) approximation 
to S given by (III. 1.5) will suffice for present purposes** (in fact, all that is needed is that 

* In order that AS have a limit as to —> — oo, C(to) generally will have to diverge loga- 
rithmically with to, cf. equations (III. 2. 9) and (III. 2. 11) below. Note that S~ can also 
be described as that solution of the Hamilton-Jacobi equation which agrees on A - with a 

certain Bondi time coordinate, as defined in Ref. (29). 
** Of course, from a fixed point of I~ , unphysical (but causal) signals may reach any gener- 
ator of J + arbitrarily early by passing through i°. Less trivially, one may see that causal 



t 




(III.1.5) 



to 



S^(x j -x j ' -(t-t')n j ) + C(t ) , 
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the error made by using (III. 1.5) be uniformly bounded on neighbourhoods of infinity of 
the sort discussed in Section (III.2)). 

It is worthwhile to interpret the preceding equations before proceeding. The phase of 
a hypothetical (massless) plane wave that couples only to the background flat metric is 
n^k^x" . Equation (III. 1.5) may be thought of as giving the phase delay experienced by a 
wave propagating through a refractive medium, following to zero th order of approximation 
the Minkowskian-null direction but experiencing refraction due to its coupling to h^ u . 
To compute null trajectories to first order of approximation, one must note that photons 
propagate so as to keep the true phase S of equation (III. 1.3) constant, not the background 
phase, so one must consider equations (III. 1.3) and (III. 1.5) together. With the choice 
of k^ as having components (1,0,0,1), the equation S = const, determines a surface 
of constant phase ruled by null geodesies that would be described in 3+1-language as 
normally outbound from a coordinate 2-plane z = zq in the asymptotic region at t = to. 
Then AS describes the amount by which this surface differs from one ruled by lines null in 
the background metric n^. Thus, one may use AS to extract the time delay^ 5,6 ^ (possibly 
negative) due to h^ v , experienced by a plane wave evolving in the asymptotic region from 
the initial plane surface. To see this, one may ask at what value of coordinate time t 
does a generator of the S = const, surface reach a point of fixed spatial coordinate x l , 
and what would be the value of t if h tJlV were neglected. The resulting time delay is given 
by At = (l/k°)AS, where k° is the projection of /c M onto the normal to the surfaces of 
constant time. If this normal is = <9 M £, one may write the formula 

Ai = #^ • < IILL6 > 

describing the Shapiro time-delay effect. This is somewhat implicit, as AS depends on At 
but, consistent with the first approximation, the final time required in the evaluation of 
AS may be taken to be the time at which the Minkowskian-null line tangent to k^ has 
spatial coordinates x 1 . 

Note that there is an interpretation in physical terms associated with the use of the 
first order approximation described above. It is the approximation in which account is 
taken of the time delay experienced by null geodesies, but no account is taken of their 



curves in spacetime may reach A + arbitrarily early from any point of X~\A~ by smoothing 
out the aforementioned unphysical curves so as to avoid % , and the retarded time thus is 
unbounded below on approach to such points. See particularly the discussion of equation 
(III. 2. 19), the remark after the proof of Theorem II.l.l in Section IV, and fig. (3). Com- 
munication involving physical signals from A~ to A + is key to our argument; for this case, 
see Lemma III. 2.1. 
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consequent bending. Such an approximation clearly is valid for geodesies which propagate 
entirely at very large radii; we intend to provide a more rigorous justification for its use in 
another place. 

III.2: Behaviour of the Retarded and Advanced Time Functions Near Infinity 

The aim of this subsection is to evaluate the time delay/advance for null geodesies propa- 
gating near infinity. One begins with a formulation of the conditions of asymptotic flatness 
that will be required and which will refer to some neighbourhood of infinity, J\f C J\A. It 
is most direct to impose conditions on the inverse metric, namely that there be a quasi- 
Cartesian coordinate system for M in which it may be written as 

g n» = V ^ _ 4 >" K± I ) + M „ ? (IH.2.1) 
r 

where r 2 = x 2 + y 2 + z 2 . The function m llu describes what will be called the DC part 
of the asymptotic metric; it is effectively a function on the cylinder S 2 x IR, and will be 
required to be C 1 and bounded. It is intended that ip^ v describe, to leading order, the 
radiative part of the metric, so it should fall off appropriately and satisfy some manner 
of transversality condition. It will be required, specifically, that ip^" be G(l/r), and 
that (p^l^lu be 0{l/r 1+e ) for some e > when Z M is taken to be either the out-going, 
coordinate- null vector Z+ = <9 M (r — t) or its in-going counterpart l~ = <9 M (r + t). 

We now introduce a precise definition for the symbol 0(l/r). This definition entails 
in particular a fall-off which is uniform on neighbourhoods of compact portions of X. 
A function / on M will be said to be of class 0(l/r^) with respect to an asymptotic 
coordinate system iff, for each function h(r) of the class r + o(r), there exists a bound 
B > such that \f(x)\ < B/r 13 for all x e N such that \x°\ < h(r). (By a function of the 
class r + o(r) is meant one for which \h(r) — r|/r ^ as r ^ oo, for example the function 
h(r) = r + a\og(r/a) + T.) Thus, in the cases of ip^ u and m^ u one has for such x, 



\(f^ u \ 


<^ , 
r 


(III.2.2a) 


1 nvi±i±\ 
\^ > v \ 


^ r (l+e) ' 


(III.2.2b) 




<K 3 , 


(111.2.2c) 



for any convenient choice of norm | • | (e.g. the supremum over the components in the 
x M basis, the root-sum-square of the components in this basis, etc.) and some constants 
Ki > 0. 

Notice that the conditions for asymptotic flatness are Poincare invariant: independent 
of the choice of origin which defines the radial direction and also independent of the choice 
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of asymptotic rest frame. They are weak enough to encompass, for example, the DC fields 
and radiation emitted by any astrophysically plausible source we know of. (Our particular 
"gauge choice" of asymptotic background metric has the disadvantage that r =F t is not a 
good coordinate on but the change of coordinates one would need to make it so would 
introduce implicitly defined logarithmic terms into the asymptotic metric, a complication 
we wish to avoid.) 

The proof to be given in Section IV will proceed entirely within a certain asymptotic 
region WCAfUl upon which a single set of bounds Ki on the fall-off of W holds sway. 
One may now estimate the retardation function AS under the assumption of such fixed 
bounds. Since equation (III. 1.5) is linear in h^ u , the contribution to AS from each of the 
two 0(l/r) terms in the metric (III. 2.1) may be computed separately. Hence, the retarded 
time function at x, corresponding to the generator A + C X+, is 

S+ ( x ) =t-z + AS + (x) , (III.2.3a) 

oo 

AS+( X ) = _I J dt' J d 3 x' k^k v h^{t',x') 8^\x-x' -(t-t')n) , 
t 

=AS+ c (x) + AS+(x) , (III.2.3b) 

oo 

AS+ c (x) =2 j dt' j d 3 x' k » K ™^ 6^(x-x' -(t- t')h) , (111.2.3c) 



t 

oo 



AS+{x) = -^ J dt' J d 3 x' k^k v ^ u {x') 8 {3 \x-x'- (t-t')n) , (III.2.3d) 



t 



with k° normalised to unity, and the (possibly infinite) integration constant coming from 
C(t ) in (III.1.5) being left implicit in (III.2.3c). 

Anticipating the answer, the DC piece will produce an unbounded contribution, so 
it will suffice to show that the (p^ contribution is bounded, and therefore negligible in 
the appropriate limit. It will be useful to characterize null geodesies (in A/", as are all 
null geodesies discussed in this subsection) by an impact parameter, defined in the usual 
manner with respect to the asymptotic coordinates as the minimum value of 

b 2 = x 2 + y 2 . (III.2.4) 

along the geodesic. 

Let = ±/^, where ± = sgn(z), so corresponds to the radial null direction, 
inbound for z < and outbound for z > 0, and with Iq = —1. Define 

^ = ^" /m ' (III.2.5) 
k^k v ^ v =l^ v + 2l li s v <p t "' + Sfl s^ . 
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Equations (III. 2. 2) may be used to bound the magnitude of each term in this expression, 
since the entirety of the path of integration lies within an asymptotic region where a single 
set of bounds Ki on the fall-off of holds sway. Hence, using s M = (0, s), one may write 
that 

oo 

5 (3) (x-x'- 

oo 

5 (3) (£-x'-(t-t» • 

The are constants determined by the Ki and the choice of norm. As well, from the 

definition of s M , one obtains 

\s\ 2 = 2(1 -\z\/r) , (III.2.7) 



(III.2.6) 



giving 



2 

|ASj(z)|<^ | (cosa)- 1 ** + cb^^-Tui- 1 ^ , (III.2.8) 

Tan-^f) 

with c 5 = C4/V2 and with tana = z'/6. 

A more succinct expression may be obtained by extending the geodesic back to t = 
—00, which yields 

7T 

2 

\&S+(x)\ < J (cosay- 1 da + c 5 (k)tt . (III.2.9) 

7T 

2 

Thus the "radiative contribution" to the retardation of a null geodesic from x e jV to A + 
is uniformly bounded in x, except possibly in the limit 6^0. Small 6 will not be relevant 
in the arguments that follow. For now, the chief lesson is that, in the case of large 6, the 
contribution to the phase delay (or advance) due to (p^ v is bounded, as claimed. 

Next, the contribution from the DC piece is required. Here the definition 

m^(x x /r) = m(r, 0, fj) , (III.2.10) 
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will be useful, with r = t/r, [i = z/r and tan6> = y/x. The integral (III. 2. 3c) reduces to 

oo 

m _'^'g , (1.1.2,!) 

t 

with the primes in the argument of m denoting evaluation of r and \i at the translated 
(retarded) value z — t + 1' of the ^-coordinate (9 is independent of z) . The problem now 
is that generally this integral diverges; it is essentially the e = C4 = case of equations 
(III. 2. 6-9). This divergence as such merely expresses the fact that, with our choice of 
asymptotic coordinates, the Minkowskian null lines along which we have evaluated A5" 
arrive at A + at infinitely late (or infinitely early as the case may be) retarded times - 
that is to say, they actually miss A + entirely. In the well known case of Schwarzschild, for 
example, the true out-going null surfaces take the form r + alog(r/a) —t = const., which 
diverges logarithmically from the null surfaces defined by rj^. As mentioned earlier, it is 
because of this effect that (III. 1.5) requires a to-dependent constant of integration, which 
reappears as a "renormalization" of (III. 2. 3c) and (III. 2. 11). 

However, what is important here is the difference in arrival time at A + of null geodesies 
with different impact parameters, and (to leading order) such differences can be found 
directly from the integral in (III. 1.5) or (III. 2. 11) because the value of the integration 
constant drops out. This is the procedure that will be followed. ^ Accordingly, consider 
a family of null geodesies with the same initial z = zq and t = to, and the same 9, but 
differing initial b. One may compute the phase difference between neighbouring finite- 
length geodesies specified by b and b + db, then let the final endpoints go to infinity, and 
lastly integrate the resulting value of dS + /db to obtain the 6-dependence of S + . (Along 
all these null geodesies, the same bounds Ki on the fall-off rates hold.) Thus, taking two 
points along a null line, one with coordinates (to, b, zq), the other with coordinates (t, b, z) 
(t > t say), and comparing the phase shift to that along the null line joining (to, b + db, z ) 
to (t, b + db, z), one obtains the result 

= + 0(l/b 2 ) . (III.2.12) 

db br 



t Another way to describe this procedure is as follows. One wants the difference S + (x'q) — 
S + (x' )- To find it, one propagetes null geodesies 7' and 7" from x and x ' respectively, to 
A + (i.e. exact null geodesies with respect to the true metric g a b)- When 7' and 7" reach 
the far asymptotic region, r — > 00, they are running parallel to and the difference in 



their /c^-normalised time of arrival at A + is given by the phase difference k^(x' — x")^, 
where x' and x" are points on 7' and 7" in the far asymptotic region. To leading order 
this phase difference is just that due to the difference in starting phase, k^(x' Q — x'q)^, plus 
the difference in the phase delay integrals (III. 1.5) computed from x' and x' Q ' to endpoints 
of equal z (or t) in the far asymptotic region. 
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Here m + denotes the value of m at the point having coordinates (t, 6, z) . In passing, it may 
be remarked that the 0(l/b 2 ) terms come from two sources, one being the initial endpoint 
contribution 2z§m~ /br = 0(l/b 2 ), the other being 6-dependence of the arguments of 
m(r, 6, fx). 

In the limit as the final endpoint goes to infinity, equation (III. 2. 12) becomes 

^ = -^ + 0(l/6 2 ) , (IIL2.13) 

where M + , the limiting value of m + , is a constant (one might have expected it to retain the 
^-dependence, but 6 parametrises a circle whose angular size at fixed b becomes negligible 
as its centre approaches infinity). In terms of /c M one has from (III. 2. 10) that 

M+ = limm+ = m^(k x /k°) k^k v . (III.2.14) 



Now take z = 0, in which case (since M + is independent of 9) one may treat S^ c 
as a function of b alone. Then, integrating (III. 2. 13) and adding in the contribution AS* 
(which has already been seen to be bounded, by (III. 2. 9)) yields 

S+(b) = -2M+ log(6/6 ) + 0(1) , (III.2.15) 



where bo is a constant of integration and 0(1) indicates terms bounded in the large b limit. 
A similar analysis gives the advanced time function at z = (and sufficiently large b) as 

S~ (b) = +2M~ log(6/6o) + 0(1) , (III.2.16) 

with M~ = m^(-k x /k°) k^k v . 

Finally, for curves that have their initial endpoints on A - and final endpoints on A + , 
one may define a time- of- flight as the difference between the (retarded) time of arrival and 
the (advanced) time of departure. In terms of this concept one may state the following 
lemma, which will be used in the proof in Section IV. 

Lemma III. 2.1: Let 7 consist of the union of two null geodesies, one that joins 
A - to a point w G M with z = and radial coordinate 6, and one that joins w 
to A + . For a fixed set of bounds k% in (III. 2. 2), there exist positive constants Rq 
and Co such that, if 7 remains always at radii greater than Rq then, to within 
a correction of magnitude less than Co, its time-of-flight from A - to A + is given 
by 

-2(M+ + M-)log(6/ J R ) • (III.2.17) 
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Proof: Consider a neighbourhood jV(k) of a compact portion of infinity on which 
the inequalities (III.2.2) hold with some triple of fixed bounds k = and let 
w E ftf have z = 0. If the radial coordinate at w is 6, then the functions S ,± (w) 
may be computed as in this section, provided the Minkowskian-null trajectory 
z — t = const, through w remains within jV(k), and it is easy to choose jV(k) so 
that this is so. Then, from (III. 2. 15) and (III.2.16), one has 

S+(w) - S~(w) = -2(M+ + M-)\og(b/R ) +0(1) . (III.2.18) 

The lemma follows. □ 

In interpreting the time-of-flight, recall that the normalisations of S ± are both fixed 
by the null vector k a which determines the generators A 1 * 1 of T^. It is possible to define 
time-of-flight for curves joining arbitrary generators. Consider for example the generator 
r + C X + associated to the null vector + cos# + sin#^:. The time-of-flight of the curve 
that leaves A~, follows a null geodesic to a point w with coordinates (6, z), then follows 
another null geodesic to T + is given by 

S + (w)-S-(w) = -2(M + + M-)log(&/# )-2M+log(l + ^tan6>) +0(1) . (III.2.19) 

For non-zero 6>, the point w may be adjusted to make this quantity arbitrarily negative, 
whence such curves may join arbitrarily late points on A - to arbitrarily early ones on r + , 
as remarked in an earlier footnote. See also the illustration in fig. (3). 

IV: The Positivity Proof 

Now it is possible to prove the theorem stated in Section II. 

Proof of Theorem II. 1.1: Choose a future-pointing asymptotic null vector k a , as 
before. This vector determines a unique null geodesic generator A + of X + and (the 
past-pointing null vector — k a determines) a unique null geodesic generator A~ of 
X~ . Recall that, by assumption, there exists a neighbourhood of infinity M and 
a quasi- Cartesian coordinate system x M for M in terms of which the spacetime 
metric fulfils the asymptotic flatness conditions introduced in Section III. 
Recall also that VUl = [J _ (X + )n/ + (X _ )]UX is assumed to be globally hyperbolic 
as a subset of M. (fig. (2)). We seek a "fastest causal curve" 7 from A - , through 
X>, to A + . The curve 7 will be constructed as the limit of a sequence of causal 
curves 7 n . 

In order to obtain 70, the initial curve of our sequence, let us begin by 
choosing xq G V and a causal curve from xq to qo G A + . (Such a curve is 
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easily constructed; one can, for example, pick x$ in the asymptotic region Af 
and proceed from there in a null direction coinciding asymptotically with k a .) 
Similarly, we can find some causal curve joining x$ to a point po G A - . All our 
considerations henceforth will be confined to the interval ({po,qo)) '■= J + (po) H 
J~(qo), by definition a compact subset of the globally hyperbolic set DUl 
For future reference, let us introduce also a radius i?o large enough so that all 
points x G ((po>9o)) with (finite) radial coordinate r(x) > Rq belong to Af, and 
let U = {x G ((pO)9o)) \ r(x) > Ro}. (Here we also admit r(x) = oo, so that 
((PO) qo)) nlC A"U {i } U A + is included in U.) Clearly, U is open in ((p , <?o)) 5 
whence its complement, K, := ((p , q ))\U : is a compact subset of £>. 

Now what will be called a "faster causal curve" than 70 from A~ to A + will be 
one which departs (weakly) later and leaves (weakly) earlier than 79. Such a curve 
will by definition have its initial and final endpoints in ((po,i )) and ((i ,qo)) 
respectively. In order to home in on a fastest curve, let us introduce a partial 
ordering < on ((po,i )) x ({i°,qo)) by taking (p',q') < (p",q") iff ((p',q'}} Q 
{(p",q")) (i.e. iff p' G J + (p") and q' G J~(q")). Then, defining F as the set of 
pairs of points in ({po, i )) x ((i°,qo)) which are joined by a causal curve through 
V, let (p, q) G F be any element of the closure of F which is minimal with respect 
to <. 

By the definition of p and q, there exists a sequence of points Po?Pi?P2, • • • 
ascending A - to p, and a sequence qo,q\,q2, ■ ■ ■ descending A + to q, such that 
each pair (p n ,qn) is joined by a causal curve 7 n through V. We claim (passing 
to a subsequence if necessary) that the curves 7 n approach a limit 7, as depicted 
in fig. (4). This follows from the known fact that, for arbitrary compact subsets 
A and B of any globally hyperbolic set, the space C(A, B) of causal curves from 
A to B is compact. (Here the topology of C(A, B) can be taken to be the so- 
called Vietoris topology, as will be shown in a forthcoming paper^ 30 ) wherein 
the existence of limit curves like 7 will be proved by a new method.) Thus, since 
((POiP)) an d ((<?, qo)) are compact subsets of the globally hyperbolic set DUZ, the 
sequence {7 n } must possess within C(((po,p)),((q,qo))) an accumulation point, 
7, as claimed. By construction, 7 is a fastest causal curve from p to A + , in the 
sense that no causal curve from p (and which enters AA) can arrive earlier on A + . 
In consequence, 7 must remain always on the boundary of J + (p) , whence it must 
be a null geodesic without conjugate points (if it enters AA at all).*- 7 ' 8 ' 31 -' 

The curve 7 will be our self-contradictory null geodesic if we can rule out 
the possibility that the 7 n "escape to infinity" (in which case we would have 
7= ((p, i°))U((i°, q)) C A~U{z°}UA + , which never even enters AA). In excluding 
this possibility, one may distinguish two cases: either an infinite number of the 
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7n meet the compact set K (defined above) or they do not. In the former case 
7 = lini7 n must also meet K since, were it to lie in its complement K' := M\K 
(an open set), the 7 n would eventually also have to lie in K' according to the 
definition of the topology of C{A, B). Thus 7 must enter M, as desired. 

Passing to the latter case, then, we may assume (by possibly omitting a 
finite number of curves from the sequence) that none of the 7 n meet K or, in 
other words, that (except for their endpoints on A^) they lie entirely within 
the subset O C V, O = {x E ((po,qo)) \Ro < r(x) < 00}. But our conditions 
of asymptotic flatness imply that, within O (which is contained in the compact 
set ((po;<7o))); the inequalities (III. 2. 2) hold with fixed bounds and we can 
assume that the R defining K has been chosen at least as large as that occurring 
in Lemma III. 2.1. The conditions for invoking Lemma III.2.1 are then met, except 
for the fact that the 7 n are not necessarily null geodesies. To remedy this, let us 
find* on each 7^ a point s n at which the ^-coordinate vanishes, and having found 
it, modify "f n so that it becomes a union of "fastest curves" 7^ from s n to A^. 
Such fastest curves 7^ can be constructed just as before for 7 itself, and must 
be null geodesies for the same reason.** Since they are fastest, the replacement 
In — ► 7n U 7+ makes 7 n faster than it was, and accordingly does not change the 
fact that the modified endpoints p n and q n also converge to the same limits p 
and q. Thus we can assume without loss of generality that each 7 n has the form 
7~ U 7+, where 7^ is a null geodesic from s n to A^. Notice finally that, again as 
before, we can also reduce ourselves to the case where all of the 7 n remain within 

a 

Invoking Lemma III. 2.1, we thus can conclude that, for M + + M~ < 0, the 
time-of-flight of 7™ from A~ to A + is bounded below by 

-2(M+ + M-)log-^-C , (IV.l) 

where b n is the value of ^/ x 2 + y 2 at s n and Co is a constant. In other words, 
the difference between the retarded time coordinate of q n and the advanced time 
coordinate of p n is at least that given by expression (IV.l). 

Assume then that M + +M~ < 0. If the b n diverged to +00, then expression 
(IV.l) would eventually become greater than the time-of-flight between p and 

* Another approach would be to broaden Lemma III. 2.1 to apply to arbitrary causal curves. 

** Alternatively, their existence follows from the fact that dJ ± (s n ) must contain a ruling null 
geodesic through q n (respectively p n ). 



19 



qo, contradicting the definition of p n and q n ; hence there is some upper bound 
Ri which the b n cannot exceed. But this means precisely that the 7 n all meet 
the compact set JC\ defined like /C, but with R\ replacing Rq or, to put it slightly 
differently, that we can choose Rq large enough so that all of the 7 n , and therefore 
7 itself, meet the compact set /CCD. 

We thus conclude that, unless M + + M~ > 0, 7 must enter V, in which 
case it (or more precisely its intersection with V) will be an infinite null geodesic 
without conjugate points, providing the desired contradiction. Appealing to the 
appendix in order to replace M + +M~ by —IP ■ k, we can conclude, finally, that 
the total 4-momentum P a must obey P a k a < 0, or, since any future-null vector 
could have been chosen as k a , that P a must be future-causal (i.e. future-timelike 
or -null). □ 

Remark: One may wonder what happens when k ■ P < 0, such as for a positive 
mass Schwarzschild metric. In that case, one cannot confine the 7 n to meet any 
compact subset of spacetime; in fact, the sequences {p n } and {q n } will converge 
to i°, each curve 7 n having successively larger impact parameter, and the limit 
curve 7 will be the "null geodesic" joining i° to itself! (In particular, for any 
p El~, I + (p) contains all of J + when P a is future-timelike.) 

To make a possibly greater appeal to intuition, it may be useful to make the point 
another way.( 13 ^ The formula? of Section III may be used to evaluate implicitly the z- 
coordinate at which a given null geodesic with endpoint at p G X~ meets a particular 
Cauchy surface parametrised by t, simply by setting 5 = constant. Then the methods of 
Section III. 2 yield 

*(«) -*, = «- 2 7 4. I^MhlA + 0(1) , (IV.2) 
J i- (i 

zo/ro 

where 0(1) signifies order one (bounded) in b. One now may ask what would be the shape 
of the wavefront defined by the meeting of a surface ruled by null geodesies of constant 
advanced time with a Cauchy surface of fixed t. To answer this, compute (dz/db) s - ;t . 
The analysis follows as before. One takes the limit to —> —00, zq — > —00, hq — > —1, which 
places the vertex of the light cone atpGl". To 0(l/b), one obtains 

which in turn implies that the wavefront formed by the null geodesies from p G X~ meets 
the chosen Cauchy surface (say t = 0) in a two-surface that is asymptotically of the form 

z = 2M~ logb + 0(1) . (IV.4) 
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Similarly, a wavefront of past-directed null geodesies from q E 1 + would also form a 
logarithmic surface, this time described by 

z = -2M+log6 + C(l) , (IV.5) 

obtained by holding the retarded time function constant. The condition that q be in the 
causal future of p is the condition that these two wavefronts meet, for which it suffices 
that M+ + M~ > (i.e. that k • P < 0). When this inequality holds, the wavefronts 
are "swept forward near infinity" and necessarily overlap there, whence q is actually in 
the chronological future of p. When the inequality holds in the opposite direction (so that 
k ■ P > 0), then the wavefronts are "swept back near infinity" and, if they meet at all, it 
is not within some neighbourhood of infinity, which is the key point. Choosing p and q so 
that the wavefronts "just touch" gives a "fastest null geodesic" 7, as before, and thence 
the contradiction. These wavefronts are depicted in fig. (5), with the Minkowski space 
case given there as well. Fig. (6) depicts the resulting set 2 + \I + (p) for M = (and for 
M < 0). 

Finally, there is a third version of the argument, which is somewhat closer to that 
of Ref. (12) and which avoids the introduction of causal curves which are not geodesic. 
In this version, one begins with p G A - and chooses a sequence of finite points p n G V 
descending a causal curve to p. Because p n is not at infinity, the boundary of its future 
will meet A + at q n (say) and will contain a null geodesic 7 n from p n to q n with no finite 
pair of conjugate points. (Proof: One knows that some past null geodesic ruling dJ + (p n ) 
originates at q n and can leave dJ + (p n ) only at p n .) One then deduces, along the lines of 
the analysis of Section III, that, for P • k > 0, the initial tangent vectors to the 7 n must 
avoid a neighbourhood of the future direction along A - in order that the q n not recede into 
the infinite future along A + (which they cannot do, since J + (p n ) Q J + (Pn+i) )• Obtaining 
a limiting geodesic 7 as before, one concludes that its tangent is not along A - (whence it 
enters M) and that it is also free of conjugate points (since a limit of conjugate-point-free 
null geodesies is also such). This is the desired contradiction. 

V: Final Observations 

In the introduction it was mentioned that one potential application of this theorem would 
be to semi-classical gravity. Another potential application would be to spacetimes pos- 
sessing closed timelike curves shrouded within event horizons. Most singularity theorems 
break down in the presence of acausalities, and Finkelstein has suggested that causality 
violations might be found instead of singularities, were one to investigate the interior of 
certain black holes. In such a situation there might be no edgeless spacelike hypersurfaces, 
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but Theorem II. 1.1 would still apply as long as the region outside the hole were globally 
hyperbolic. 

Returning to the semi-classical situation for a moment, there is a potential extension 
of Theorem II. 1.1 which would also be of interest in that context. Classically, the fact 
that the energy is bounded below is often taken to imply the stability of flat space, and an 
extension of this fact to semi-classical gravity* would be a step toward proving stability in 
the fully quantum case. However the result presented here does not deal with the case of 
zero mass. For completeness, one would like such an extension even in the classical case, 
of course, but it has a greater potential interest semi-classically. There, unless one can 
rule out the existence of non-flat solutions of zero energy, one leaves open the interesting 
possibility of inequivalent ground states between which tunnelling would be expected to 
occur. 

It seems that any P a = result would be obtained most naturally in conjunction with 
a further generalisation, a proof that the 4- momentum cannot be null. The reason is that, 
in both situations, one would be dealing with the case P ■ k = for an appropriate k a ; in 
the P a = case any choice of k a will do, while in the other case one chooses k a parallel 
to P a . For such a k a , k a k b h a b falls off faster than C(l/r), say perhaps as 0(l/r 2 ). If one 
considers null geodesies originating at p on the generator of 1~ corresponding to this k a , 
the result of this fall-off rate is to modify the relative delay of neighbouring null geodesies 
at large b. In analogy with equation (IV. 3), one obtains 

Bz 

- = G(l/6 2 ) . (V.l) 

When one integrates this, one sees that the relative time delay of different null geodesies 
(at least, ones that propagate so as to remain always near J) vanishes as 0{l/b), so the 
null geodesies from p now will form a sequence whose large b limit arrives at X + at finite 
retarded time U (equal to the retarded time of arrival one would compute by ignoring h a b 
and using the flat metric rj a b).** If any of the null geodesies from p with finite b arrive at 
earlier retarded times than U, then the earliest of these arrivals is a null geodesic ruling 
the boundary of I + (p) and so cannot have focused, whence the theorem applies. However, 
it is conceivable that no geodesic arrives earlier than the limiting time U, in which case 
there is no apparent contradiction. 

* Once again, the reference here is to stability in the context of the semi-classical Einstein 

equations. It is not only to the more limited question of the presence or absence of instanton 

solutions tunneling to classical states of lower energy, which of course are already ruled 

out by the classical positivity of energy. 
** A special case of this behaviour is Minkowski space, where all null geodesies from any fixed 

p G X~ arrive at X + at exactly the same retarded time. 
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This would seem unfortunate, and one might hope that there may be more subtle con- 
tradictions that emerge in this case because it seems reasonable to hope that the methods 
applied herein should yield results known to be true from the earlier techniques. However, 
there is also a contrary indication that perhaps any such extension of the present methods 
would prove too much! Specifically, postulate for a moment the existence of N additional 
non-compact spatial dimensions. Then, roughly speaking, finiteness of the ADM mass 
requires that the metric behave as 

g ab = Va b + 0(l/r 1+N ) , (V.2) 

which, again in analogy with equation (IV. 3), yields 

| = 0(l/b^ N ) , (V.3) 

even for future-timelike P a . Again, the integral of this shows that the endpoints of null 
geodesies of successively larger b approach a finite limiting value of retarded time on X + , 
precisely mimicking the behaviour in the usual three spatial dimensions when P a = 0. 
There is thus a danger that any proof excluding P a = in 3+1-dimensions would also 
erroneously exclude known asymptotically flat solutions with P a ^ in higher dimensions. 

Another possible line of generalisation which might be instructive to pursue concerns 
alternative gravity theories, or theories with compactified extra dimensions. In some of 
those theories, positive energy theorems have been proved, and an attempt to reproduce 
them with the methods of this paper would help disclose the extent to which Theorem 
II. 1.1 depends on special features of Einstein gravity in four dimensions. For example, 
in 5-dimensional Kaluza-Klein theory, the total energy is positive for some choices of 
spatial topology but of indefinite sign in others, whereas it is not apparent how the general 
ingredients entering into the present proof, such as the focusing of null geodesies, would be 
affected by the dimensionality, or how they could be sensitive to the topology in this way.t 

t The zero-energy solution discussed by Witten^ 32 ) provides one interesting example. Al- 
though one might think that a theorem analogous to ours ought to rule out such metrics 
(or really the negative energy solutions with which it is associated^ 33 -*), it turns out that 
spacetime of reference (32) contains a "bubble of nothingness which grows at the speed of 
light" , and our conditions of asymptotic flatness would not be satisfied in the presence of 
such a disturbance. Kaluza-Klein examples wherein the topological character of X can be 
affected in other interesting ways also exist, such as the monopole solution, whose twisted 
character as a circle bundle would seem to lead to an extended spacetime M, which is not 
even a manifold^ 34 ) . 
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Similarly, in Kaluza-Klein and other theories, the inertial mass is typically less closely 
tied to the "active gravitational mass" than it is in General Relativity, and the purely 
transverse character of radiation can also be lost — both of these being elements on which 
the present method of proof directly depends. 

Finally, it seems likely this proof may be generalised to 4-dimensional metrics with 
slower fall-offs rates. Although the fall-off conditions assumed herein seem fairly weak, it 
is hoped that the method of proof will apply as well to metrics with fall-off rates as slow 
as l/r^ +e , < e < 1/2, on approach to i°, for such a rate is sufficient to allow for the 
definition of a convergent mass integral. With such fall-offs, one might expect the b — > oo 
logarithmic divergences encountered in Section IV to be replaced by even stronger power 
law divergences, so it seems reasonable to hope that the method of proof used would apply 
in those cases as well. 
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Appendix: Superposed Asymptotically Schwarzschild Metrics 

With respect to a flat background metric rj a b, the Schwarzschild metric may be written 
in isotropic form as 



9ab = 



1+ mv /i^e- 



MY 

U r 'i, l( .U"ii M + ( 1 + — J 'r h b , (A.l) 



where rj ab U a U b = -1, and p 2 = rj ab X a X b + (rj ab X a U b ) 2 . The 4-momentum of the solution 
is P a = MU a . By convention, U a will be future-directed, so a negative value of P° means 
a negative value of M. Asymptotically, this metric is 



dab 



2M\ AM A 

1 + )Vab+ U c r] ac U d r)b d 

P J P 



(A.2) 
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From equation (A. 2) it is easy to write down the superposition of an arbitrary number 
of asymptotic Schwarzschild metrics indexed by A. One obtains 

a -r. | n ^ M A i lah + 2r lac U c A i lhd U d A 

9ab —Vab + * > J • 1 A -<JJ 

A P A 

For definiteness, the superposition has been written as a discrete sum, but more generally 
one could include a continuous integral as well (and possibly still more general sorts of 
superposition). Because the Ma are not restricted to be positive, a superposition of this 
sort may possess any value of P a , including spacelike and negative timelike ones. 

For the sake of deriving equation (A. 8) below, we assume that a superposition like 
(A.3) can serve, on approach to spatial infinity, to describe the most general behaviour of 
the metric at 0(l/r) or, in other words, that such a form captures the asymptotic metric 
which remains after radiation terms have been excluded. Correspondingly, we reason in the 
main text as if the most general DC metric can be taken to be of this form, and we limit 
the metric perturbation m^/r of expression (III. 2.1) accordingly. While this could be 
adopted as part of our definition of asymptotic flatness, we suspect that such a limitation 
would be merely a convenient short-cut which could be avoided at the cost of some extra 
work needed to derive (A. 8) directly from fall-off conditions together with the asymptotic 
field equations. 

Now compute M ± , as defined by equations (III.2.1), (III.2.10), and (III.2.14), for the 
metric (A.3) and the (future-pointing) null vector k a . By definition, this is just the limit 
as A — * ±oo of I j-rh ab (x) k a k b , where (x — xo) a = Xk a and r is the radial coordinate of x. 
Since M + is linear in h ab , it may be computed for (A. 2) alone. Denoting the vector d/dt 
by e a and recalling that k ■ k = 7] a bk a k b = 0, one has 

P 2 =7]abX a x b + (U ■ x) 2 

=(k ■ U) 2 \ 2 + 0(A) (A.4) 
=>. p = - \k ■ U + C(A°) , 

and analogously 

r = -Afc • e + 0(A°) . (A.5) 
(Here O(X ) means boundedness in A.) Hence, to leading order in 1/r and 1/A, we have 

h ab k a k b =( V ab + h ab )k a k b = g ab k a k b 

=g ab k a k b = {l + —)(k-k) + —(U-k) 2 

4M ^ 2 -i&.e [ (A.6) 



-X(U-k) r 
M+ = M~ =M(U-k)(k-e) = (P-k)(k-e) = -k°P-k . (A.7) 
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Therefore, if k° = 1 (as in the main text), then 



M+ + M~ = -2P-k , (A.8) 

for (A. 2), and hence for (A. 3). This may be inserted into the results of Section III and 
used in the proof of Theorem II. 1.1 presented in Section IV. 

In connection with Section III. 2, it may be illuminating to consider an example consist- 
ing of the addition of two asymptotic metrics of the above form. For sake of the example, 
the result of this addition will be expressed in a frame where one of the two constituent 
metrics is time-independent. In this frame, say that the other metric is boosted so that it 
has 4-velocity proportional to (1, 0, 0, V) with | V\ < 1. Let R 2 = X k X k with k G {1, 2, 3}. 
Then the result is 



ds 2 =h+ 2Ma 



A=l,2 R 



T^ + (T-V A Z) 2 ]~ 1/2 



Vab dX a dX l 



+ 7^1 MidT 2 + 



R 2 R 2 (l-V 2 ) 

M 2 (dT 2 -2VdZdT + V 2 dZ 2 ) \ (A.9) 

RV" 1 ™ ' VT^V 1 TP T 2 /R 2 )(l - V 2 ) + (T/R - VZ/R) 2 



where V\ = and Vi = V. 

Consider two cases, one where the null geodesies of Section III propagate perpendicular 
to the boost velocity and one where they propagate parallel to the boost velocity. This 
corresponds to two different choices of generator of I~ on which to place the initial point 
from which the null geodesies emanate. One may read off the function M{T/R, 9, ji). The 
two cases give 

MAT/R^)=M l + -^=! [ (l-V 2 )(l-^) + (^-V^ (A.lOa) 
M (77/?,//) = AL + , - , (1 - VYl (\ r J !i I 



I (T/R, ,) =M t + (1 - V) 2 { (1 - V 2 ) (l - T - 2 



Taking the limits to X ± , one obtains 



M 2 



M+ + M-=2^M 1 + -^=j , (A.lla) 

m ii +M ir= 2 ( Mi+ ^7iW) • (A - llb) 
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Lastly, since the ADM 4-momentum of this configuration is given by adding the 4- 
momenta of the two constituent solutions, one boosted and one stationary, then 



p a = (Mi + 



M 2 



,0,0, 



M 2 V 



) 



(A.12) 



In agreement with (A. 8) this results in 



M+ + M~=-2k ± -P 
M+ + M~ = - 2k\\ ■ P 



(A.13a) 
(A.13b) 



where kj_ and fcy are future-pointing Minkowskian-null vectors normalised so that k° = +1 
in both cases. The spatial components of kj_ are orthogonal to those of the boost vector 
U a while those of ku are parallel. 
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Figure Captions 



We regret we cannot provide electronic versions of the figures. 

Fig. (la): Focusing of null geodesies. The geodesies enter the crossing region and leave the 
boundary of the future of the initial point, en route to the conjugate points they will 
eventually encounter when they reach the caustics. 

Fig. (lb): Focusing of null geodesies. A single pair of geodesies is shown. At q, they cross and 
leave the boundary of the future of p, but they must obey local causality, so they 
remain (for awhile) on the boundary of the future of q. They will eventually reach 
the caustics. 

Fig. (2) : An example of the Domain of Outer Communications of a particular asymptotic region 
of some solution. 

Fig. (3): A conformal compactification with i° properly displayed as a single point. Spacetime 
is exterior to the cones which represent infinity. From p e A - , one may reach any 
point of any generator of X + , except A + , by a causal curve in spacetime produced by 
smoothing out the curve that is the union of A - with {i } and with the appropriate 
generator of X + . Such a smooth causal curve in spacetime is displayed as a dotted 
line here joining p to q'. 

Fig. (4): The sequence of curves {7™} used in the proof of Theorem II. 1.1, for the case M < 0. 

Fig. (5a): The t = surface in Minkowski space. In flat space, parallel lines, here defined by null 
geodesies incident on plane wavefronts, meet at infinity and so define the points of X^. 
If the two 2-plane wavefronts shown lying in this Cauchy surface are truly parallel, 
it is clear they define points of X ± that are not in causal contact with each other. If 
they are not parallel, they must meet somewhere, and causal contact is established. 
This gives rise to the "missing- half-generator" of fig. (6). 

Fig. (5b): A spacelike surface when M > 0. Now curvature bends geodesies en route to/from 
J, producing "logarithmic wavefronts" that always meet. The augmentations that 
appear on the wavefronts are the result of the focusing depicted in fig. (1). 

Fig. (5c): A spacelike surface when M < 0. The logarithms of fig. (5b) are reversed. Again, 
wavefronts may fail to meet. Because the fall-off near infinity is only logarithmic in 
the impact parameter, only one half-generator of X + is excluded from the future of 
any chosen point in X~ , as depicted in fig. (6). 

Fig. (6): Compactified Minkowski space displayed conventionally, so that i is not correctly 
represented as a single point. This diagram is also appropriate to M < space. The 
accented half-generator of X + is the set of all points in X + \I + (p). Note that i° is 
incorrectly represented — it should be a single point. 



29 



